Schema-Agnostic Indexing with Azure DocumentDB

نویسندگان

  • Dharma Shukla
  • Shireesh Thota
  • Karthik Raman
  • Madhan Gajendran
  • Ankur Shah
  • Sergii Ziuzin
  • Krishnan Sundaram
  • Miguel Gonzalez Guajardo
  • Anna Wawrzyniak
  • Samer Boshra
  • Renato Ferreira
  • Mohamed Nassar
  • Michael Koltachev
  • Ji Huang
  • Sudipta Sengupta
  • Justin J. Levandoski
  • David B. Lomet
چکیده

Azure DocumentDB is Microsoft’s multi-tenant distributed database service for managing JSON documents at Internet scale. DocumentDB is now generally available to Azure developers. In this paper, we describe the DocumentDB indexing subsystem. DocumentDB indexing enables automatic indexing of documents without requiring a schema or secondary indices. Uniquely, DocumentDB provides real-time consistent queries in the face of very high rates of document updates. As a multi-tenant service, DocumentDB is designed to operate within extremely frugal resource budgets while providing predictable performance and robust resource isolation to its tenants. This paper describes the DocumentDB capabilities, including document representation, query language, document indexing approach, core index support, and early production experiences.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Schema Agnostic Indexing with Live Indexes

Now-a-days, schema is the most popular standardized language to describe data. Developers are working with applications that create massive volumes of new, rapidly changing data types — structured, semistructured, unstructured and polymorphic data. Long gone is the twelve-to-eighteen-month waterfall development cycle. Now small teams work in agile sprints, iterating quickly and pushing code eve...

متن کامل

How hard is this query? Measuring the Semantic Complexity of Schema-agnostic Queries

The growing size, heterogeneity and complexity of databases demand the creation of strategies to facilitate users and systems to consume data. Ideally, query mechanisms should be schema-agnostic, i.e. they should be able to match user queries in their own vocabulary and syntax to the data, abstracting data consumers from the representation of the data. This work provides an informationtheoretic...

متن کامل

Schema-agnostic vs Schema-based Configurations for Blocking Methods on Homogeneous Data

Entity Resolution constitutes a core task for data integration that, due to its quadratic complexity, typically scales to large datasets through blocking methods. These can be configured in two ways. The schema-based configuration relies on schema information in order to select signatures of high distinctiveness and low noise, while the schema-agnostic one treats every token from all attribute ...

متن کامل

On the Semantic Mapping of Schema-agnostic Queries: A Preliminary Study

The growing size, heterogeneity and complexity of databases demand the creation of strategies to facilitate users and systems to consume data. Ideally, query mechanisms should be schema-agnostic or vocabulary-independent, i.e. they should be able to match user queries in their own vocabulary and syntax to the data, abstracting data consumers from the representation of the data. Despite being a ...

متن کامل

Transaction Processing Techniques for Modern Hardware and the Cloud

The Deuteronomy architecture provides a clean separation of transaction functionality (performed in a transaction component, or TC) from data storage functionality (performed in a data component, or DC). For the past couple of years, we have been rethinking the implementation of both the TC and DC in order for them to perform as well, or better than, current high-performance database systems. T...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PVLDB

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2015